Active Imitation Learning via State Queries

نویسندگان

  • Kshitij Judah
  • Alan Fern
چکیده

We consider the problem of active imitation learning. In passive imitation learning, the goal is to learn a target policy by observing full trajectories of it. Unfortunately, generating such trajectories requires substantial effort and can be impractical in some cases. Active imitation learning reduces this effort by querying the teacher about individual states. Given such a query, the teacher may either suggest an action for the state or declare that the state is “bad” in the sense that the teacher’s policy would never go there. Standard active-learning techniques, do not account for the state-visitation likelihood of the target policy, and hence can perform poorly by asking many “bad” queries. We describe a new approach to this problem that is inspired by viewing query selection in the framework of Bayesian active learning, resulting in a version of Query-by-Committee for active imitation learning. Our experiments in two test domains show promise for our approach compared to a number of alternatives.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Active Imitation Learning of Hierarchical Policies

In this paper, we study the problem of imitation learning of hierarchical policies from demonstrations. The main difficulty in learning hierarchical policies by imitation is that the high level intention structure of the policy, which is often critical for understanding the demonstration, is unobserved. We formulate this problem as active learning of Probabilistic State-Dependent Grammars (PSDG...

متن کامل

Active Imitation Learning via Reduction to I.I.D. Active Learning

In standard passive imitation learning, the goal is to learn a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider active imitation learning with the goal of reducing this effort by querying the expert about the desired action at indi...

متن کامل

Active lmitation learning: formal and practical reductions to I.I.D. learning

In standard passive imitation learning, the goal is to learn a policy that performs as well as a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider active imitation learning with the goal of reducing this effort by querying the exper...

متن کامل

Active Imitation Learning

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has shown the value of imitation in domains where a single mentor demonstrates execution of a known optimal policy for the benefit of a learning agent. We consider the more general scenario of learning from mentors who are ...

متن کامل

RobotCub Activity Report

ion Levels for Robotic Imitation: Overview and Computational Approaches. Manuel Lopes, LuisMontesano, Francisco Melo and José Santos-Victor, in From motor to interaction learning in robots, inpress, Springer Verlag, 2009. Theoretical tools for learning from imitation – Under this topic, IST has investigated the use of inversereinforcement learning as a tool for recovering the mechan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011